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S cientists have an inherent interest in claiming their contribu¬ 
tions to the scholarly record, but the fragmented state of iden¬ 
tity management across the landscape of astronomy, physics, 
and other fields makes highlighting the contributions of any 
single individual a formidable and often frustratingly complex task. 
The problem is exacerbated by the expanding variety of academic 
research products and the growing footprints of large collaborations 
and interdisciplinary teams. In this essay, we outline the benefits of 
a unique scholarly identifier with persistent value on a global scale 
and we review astronomy and physics engagement with the Open 
Researcher and Contributor iD (ORCID) service as a solution. 

1 Introduction 

The scientific record reflects an enduring conversation among researchers around 
the globe and across the millenniums, from Aristotle and Ptolemy through 
Maxwell and Einstein and up to the present. Each day, new voices join the 
conversation and new ideas are expressed in articles, books, software, data 
and other products created by scientists and conserved by libraries and other 
repositories [1]. 

Digital technologies have accelerated the pace of this discussion.^ Open 
access repositories such as arXiv.org provide liquidity for the marketplace of 

^Consider the 2014 claimed discovery of primordial gravity waves in cosmology, where 
theoretical papers [e.g., 2, 3] interpreting the BICEP2 observations appeared on arXiv.org 
within hours of the experimental results themselves [4]. 
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ideas, and public social media outlets support open, informal exchanges among 
colleagues in real time. Scientists find value in the efficiency of such digital-only 
discourse, even if preservationists may fret over its fragility [5]. 

The conduct of scholarship relies on the ability to register, identify, and 
persistently access particular snippets of this massive conversation [6]. Digital 
object identifiers [DOIs, 7, 8] help identify what was said by providing unique, 
persistent web links to refereed papers, source materials, data, codes, videos, 
and many other types of content. 

Identifying who contributed to the conversation is much harder [9-12]. While 
journal papers and other media list contributing author names, typically with 
institutional affiliations, this information is often insufficient to uniquely iden¬ 
tify any single individual, especially over time. The community of scientific 
researchers - indeed, the scholarly community as a whole - lacks the functional 
equivalent of a DOI for scholarly identity, a persistent identifier that uniquely 
links any researcher to her or his body of contributions. 

This is not to say that scholarly identity systems don’t exist today. Anyone 
who has published in a major journal, posted to arXiv.org, applied for science 
agency funding, or added content to an institutional or public repository has 
an identifier for that particular platform. The problem is that such balkanized 
identities make it nearly impossible to form a complete view of a person’s 
contributions. In addition, for-profit companies that host scholars’ identities 
may seek to monetize that content in ways that run counter to the open culture 
of scientific inquiry. 

In this article, we review the benefits to researchers, particularly those early 
in their careers, of possessing a unique, non-commercial identifier that breaks 
through existing silos. After reviewing the fragmented landscape of scholarly 
identity in §2, we describe the Open Researcher and Contributor iD (ORCID) in 
§3 and discuss ongoing integration efforts relevant to the domains of astronomy 
and physics. We briefly discuss ORCID in a broader context of Internet identity 
in §4 before summarizing in §5. 

We do not attempt to offer general advice on managing a professional profile 
[though for some concise guides, see 13, 14]. Nor do we venture into cre- 
dentialing, the realm of authorization and authentication to Internet services 
[e.g.,InCommon, 15], although potential “cross-walking” applications are men¬ 
tioned below. Rather, we seek to motivate the case for linking each researcher 
with the scholarly products created by her or him in a unique, persistent, and 
global manner, and present ORCID as a solution. 

The authors of this paper include three ORCID ambassadors, a group who 
seeks to “encourage the adoption of ORCID identifiers locally and globally” [16]. 

2 A Complex Identity Landscape 

Each of us is accustomed to having a unique civilian identity based on a valid birth 
certificate, passport, or social security number issued by a governmental authority. 
In contrast, beyond degree certification documents, professional scholars have 
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Figure 1: A schematic view of layers across which a researcher typically operates 
using multiple identities. The innermost layer involves identity at one’s host 
institution while a domain layer involves interactions with professional 
organizations, projects and societies. The products layer adds publishing 
houses and hosting services for articles, data, code, and other scholarly prod¬ 
ucts. The broadening scope and scale makes it harder to uniquely identify 
an individual in the outer layers. Uniting the layers with a single scholarly 
identifier - an ORCID iD - clarifies the connections between repository 
items and the projects, funding sources, and individuals responsible for 
creating them. 


no uniform mechanism to assign a unique identifier to an individual. Instead, 
layers of tangled identities develop around us. 

2.1 Layers of identities 

Researchers in astronomy, physics or similar disciplines work in a larger ecosystem 
of scholarly endeavor that we present as a set of three layers in Figure 1. This 
representation is simplistic, and not intended to be complete, but it serves to 
separate the organizations that scientists typically interact with into different 
functional roles. 

The institution layer consists of a researcher’s home institution and its 
internal components. At this level, digital identity is, by and large, managed 
by the central information technology organization of the organization. Even 
so, multiple identities can exist here due to internal systems that have not been 
centrally aligned or due to free-floating services (wikis, databases, etc.) that 
host their own authentication. 

Outside the institution, a domain layer includes organizations that serve 
and fund science, as well as the facilities and collaborations that generate new 
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discoveries. Interacting with the information systems in this layer typically re¬ 
quires creating separate identities for each organization, although an institutional 
e-mail address often serves as a common naming device. Funding agencies and 
most scientihc societies are active at the national level, whereas large projects 
and facilities, and transnational societies such as the International Astronomical 
Union, support members from around the globe. The international nature of 
science demands a global identity solution. 

An outer products layer holds the materials generated within the inner two 
layers. This space contains publishing in both traditional and modern forms, 
the former emphasizing commercial publishing concerns and the latter including 
a variety of web publishing formats, many of them open access. Traditional 
companies such as Reed-Elsevier and Thomson-Reuters have developed their 
own mechanisms for identifying individual authors, and these identities inform 
their commercial research analytics platforms Scopus [17] and Web of Science 
[18], respectively. 

While peer-reviewed literature published in established journals remains 
a dominant force across physics and astronomy, the open market created by 
arXiv.org rivals in importance because it provides real-time access to new science. 
The arXiv, which displaced the distribution of physically mailed pre-prints with 
an open-access e-print repository, is complemented by the abstract and indexing 
services provided by INSPIRE [19] and ADS [20]. These sites emerged in the 
early days of the World-Wide Web and are now heavily used by astronomers 
and physicists as communication, discovery and assessment tools. 

Newer services in the products layer host and distribute code, data, and other 
digital products such as presentation slides, videos, and lab notebooks. Some 
sites focus on the needs of a particular science domain while others, including 
the new non-commercial service Zenodo [21] or commercial software repositories 
such as Github [22] or Bithucket [23], serve a much broader audience beyond 
astronomers and physicists. 

The landscape charted in Figure 1 is by no means exhaustive, but most 
practicing astronomers and physicists at any rank should hnd its elements 
familiar. Accrediting and linking contributions across the entire landscape of 
Figure 1 is a rational desire that would benefit both individuals and organizations. 
From that basic concept, we now turn to offer a few practical reasons to care 
about a unique scholarly identity. 

2.2 Practical motivations for unique scholarly identity 

Scientists are professionals who need to advance their careers, help deserving 
colleagues move forward in their careers, and generally take part in the larger 
workings of scientific society that includes evaluating, crediting and criticizing 
the work of others. Here are some common activities that a unique identifier 
can help support. 

o Claim appropriate credit. For two people in the same field sharing a 
name, the opportunity for disambiguation serves as the simplest motivation 


Page 4 of 13 



for a unique identifier. The pair of Jens Jensens at the Niels Bohr Institute 
in Copenhagen would certainly benefit. So too wonld the more than 100 Y. 
Zhangs who are registered to arXiv.org [10]. The difficulty in identifying 
and evaluating publications authored by someone with a common name 
often frustrates writers of recommendation letters. 

o Manage life transitions. Circumstances such as marriage can lead to 
a change in one’s name. In the present, fragmented state of affairs, the 
responsibility is entirely on the researcher to inform all the elements in 
the academic landscape of such a change. How easy will it be for others 
to view the full body of work for someone experiencing this personal 
transition? Will discovery services like ADS and INSPIRE automatically 
extend searches across the new and old names? A unique ID associated 
with both names is a simple mechanism to achieve continuity through such 
transitions. 

o Manage career migration. Researchers in the early stages of an aca¬ 
demic career are likely to change institutional affiliation multiple times 
as their careers advance (as nicely illustrated for dotAstronomy members 
by Daniel Foreman-Mackey [24]). Some will exit the academic profession 
and transition to other employment. A persistent identifier that follows 
a person through these career transitions would allow that individual’s 
contributions to the products layer to be easily consolidated over time and 
across multiple locations, including those in industry or commerce. 

o Be socially responsible. Linking scientihc output to taxpayer investment 
is a socially responsible thing to do. In 2013, the Office of Science and 
Technology Policy in the United States directed federally-funded agencies 
“to support increased public access to the results of research funded by 
the Federal Government” [25], and this mirrors a similar announcement 
by the European Commission [26]. A collection of repositories of open 
access products tagged with ORCID identifiers constitutes a knowledge 
fabric capable of supporting these directives by facilitating links between 
materials in the products layer, funding sources and projects in the domain 
layer, and grant recipients within institutions. 

o Stand out in a crowd. Scientific research is increasingly collaborative 
[e.g., 27], but working in large teams on long-term projects is viewed as 
potentially detrimental to early career scientists. Will a graduate student’s 
contributions to an alphabetically-ordered publication with a hundred 
or more authors be appropriately recognized? While the promotion and 
tenure process still relies heavily on letters from expert reviewers [28], 
so-called “arms-length” letter writers may themselves be unaware of an 
individual’s specific contributions. For projects that operate transparently 
in the products layer, a unique author ID that labels the codes built, data 
sets created, and presentations made by a young researcher can bring those 
many contributions into focus. 
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3 Open Researcher and Contributor ID (ORCID) 

In 2010, a group of publishers, academics, and librarians incorporated a non-profit 
organization with a mission to solve the name ambiguity problem in scholarly 
communication. The Open Researcher and Contributor iD [29] solution takes 
the form of a central registry of unique identihers, known as ORCID iDs, for 
individual researchers along with open and transparent mechanisms to link these 
with existing researcher identifier schemes. ORCID principles [30] emphasize 
openness, transparency, broad scholarly scope on a global scale, and respect for 
the right of researchers to set privacy settings on their records. 

The service is governed by a board of sixteen members with representatives 
from multiple areas including publishing, academia and information science. The 
American Physical Society was a launch partner of ORCID, and the American 
Astronomical Society joined as a sponsor in 2011. Similar to arXiv, a member 
fee model supports the organization’s operations. 

3.1 ORCID Services 

The basic service that ORCID provides is the creation of a unique, 16-digit 
identifier for an individual. The cost of entry is low; a name and e-mail address 
is all that’s needed to create an identifier, with additional information such 
as affiliation and position optional. After an ORCID iD is created, the owner 
has choices to make regarding populating and sharing additional information 
in the ORCID registry. Ideally, all work across the landscape of Fig. 1 would 
be rounded up and used to populate one’s registry, but this goal is only partly 
possible today. We discuss below the status of some automatic ingest capabilities 
for physics and astronomy. 

There are three privacy settings — public, limited access, and private — that 
can be applied in a granular manner to registry contents. Settings are chosen at 
the time of account creation and can be modified at any time. A researcher can 
choose to become “invisible” to the public by marking all fields other than the 
ORCID identifier as private. ORCID does not require a researcher to populate 
and expose registry contents through the orcid.org website. 

Account delegates may be assigned to manage a registry. The application 
program interfaces for ORCID services allow account management to be auto¬ 
mated, and one can control the set of trusted parties that are able to update 
registry contents. 

A large, active list of integration efforts is maintained on the web site’s 
integration page [31]. Efforts relevant to the astronomy and physics communities 
are summarized in Table 1. 

3.2 ORCID integration at academic institutions 

Dozens of universities across the world have enrolled as ORCID members. In the 
US, both private (e.g., Boston University, Caltech, Harvard, and MIT) and public 
(e.g., Texas A&M University, Purdue University, and University of Michigan) 
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Organization 

Various universities 
NSF, DoE, NASA 
AAS, APS 
INSPIRE, ADS 
Zenodo, Github, Eigshare 


Activity 

Assigning ORCID iDs to campus researchers 
Planning for ORCID iDs in grant management 
Using ORCID iDs for journal authors, referees 
Developing ORCID iD integration services 
Support ORCID iD tags for data and code 


Table 1: ORCID engagement within the physics and astronomy communities. 


institutions are at various stages of planning and implementing the assignment 
of ORCID iDs to their researchers. 

The John G. Wolbach Library is actively promoting ORCID within the 
Harvard-Smithsonian Center for Astrophysics and the larger astrophysics com¬ 
munity. The Texas A&M University Library began enrolling junior researchers, 
mainly graduate students and postdocs, with the aim of tagging all doctoral 
dissertations with ORCID iDs. The University of Michigan Library is assigning 
ORCID iDs within the School of Medicine in 2014, with other colleges to follow. 

3.3 ORCID in the domains of physics and astronomy 

Within the US, the American Physical Society (APS) and American Astronom¬ 
ical Society (AAS) have endorsed ORCID as a mechanism for solving author 
disambiguation and for streamlining the workflow of scholarship. APS members 
can apply for an ORCID via the APS Author Profile page [32]. The AAS has 
invited its members to add ORCID iDs to their profile [33] as part of a larger 
digitization strategy that is planned to include DataCite DOIs for data associated 
with AAS-sponsored journal articles [34]. The American Institute of Physics 
and UK-based Institute of Physics are also ORCID members. 

Science agencies across multiple countries, including Japan, the Netherlands, 
Portugal, the UK and the US have called for the use of persistent identifiers for 
researchers involved in grants submissions and management processes. In the 
US, adoption of ORCID iDs is being led by the National Institutes of Health 
[35] , and other agencies are at various stages of planning. The Office of Scientific 
and Technical Information in the US Department of Energy (DoE), the part 
of the agency that maintains and disseminates the results of DoE’s $10 billion 
annual investment in research and development, joined ORCID as a member in 
May 2013. The US National Science Eoundation sees potential benefits in unique 
identifiers, including strengthening its portfolio analysis capabilities, facilitating 
the process of research reviews, and reducing administrative burdens [36]. 

3.4 ORCID in the products layer 

The variety of elements that comprise the products layer continues to expand as 
the digital era of scholarship matures. Indeed, the notion of scholarly products as 
a term to encompass contributions beyond traditional publication forms of journal 
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articles, conference proceedings, and books is itself relatively new. Platforms 
that host the varied outputs of science are now a mix of commercial and open 
access services, the latter including repositories hosted by research universities. 
Due to its complexity and evolving nature, it is difficult to comprehensively 
review ORCID engagement across this layer; we provide here a snapshot of 
activities relevant to the needs of astronomers and physicists. 

In terms of article repositories, ORCID currently supports imports from the 
commercial indexing services SCOPUS and Web of Science. These services focus 
on tracking products in peer-reviewed journal literature. On the other hand, 
physicists and astronomers mainly use non-commercial services for search and 
discovery, namely INSPIRE and ADS, respectively. ORCID integration with 
these services is under development. 

INSPIRE, managed by CERN, currently allows author profiles to be linked 
with ORCID, and it will soon deploy tools to align the ORCID registry with 
INSPIRE records via push/pull operations. INSPIRE will retrieve any missing 
publications from ORCID, and an author claiming system will allow the addition 
of INSPIRE records to an ORCID registry. As of June 2014, roughly 250 
INSPIRE authors have linked ORCID iDs to their records (Laura Rueda, priv. 
comm.). APS reports over 5500 authors and more than 3200 referees registered 
with ORCID iDs (Mark Doyle, priv. comm.). The latter constitutes roughly 10 
percent of the overall referee cohort. 

For astronomers, the NASA ADS service is in the planning stage of ORCID 
integration (Alberto Accomazzi, priv. comm.). As of June 2014, over 1600 
authors on ADS have ORCID iDs (Carolyn Grant, priv. comm.), and 11% of 
authors in AAS journals have ORCID iDs (C. Biemesderfer, priv. comm.). 

The arXiv.org e-Print archive is also moving towards ORCID integration. The 
2014 arXiv Roadmap [37] lists support for ORCID identifiers as a goal and notes 
improved interoperability with authority records from external repositories and 
better mechanisms for reporting institutional statistics to member organizations 
as benefits. 

Beyond articles, ORCID iDs can be integrated into Github as part of a 
new collaboration with the Zenodo service. Zenodo serves as “an open digital 
repository for everyone and everything that isn’t served by a dedicated service; 
the so called ‘long tail’ of research results” [21]. The ODIN project [38] is 
building a tool that will allow ORCID iDs to be added to research products 
associated with DataCite DOIs. The HubZero science collaboration platform 
[39] is also integrating ORCID iDs into its workflows. ORCID identifiers for 
content creators can be included in the citation metadata for datasets in the 
DataCite Metadata Store. The public hosting service Figshare [40] allows users 
to tag their entries with ORCID iDs. 

In short, a modest but growing level of ORCID integration exists in the 
sector of the products layer relevant to astronomers and physicists. 
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4 Discussion 


We have emphasized the benefits of ORCID iDs but there are associated costs and 
potential risks. In a 2014 discussion about ORCID on the Astronomy Facebook 
page, some participants expressed skepticism about having yet another on-line 
identity to maintain, especially if its use is mandated by some organization. 
While there is a time cost for an individual to obtain an iD, it can be reduced to 
under five minutes by setting one’s privacy settings so that all registry content is 
private. This approach avoids having the ORCID web site become yet another 
professional site one needs to maintain. 

The dominant sentiment of the Astronomy Facebook discussion, however, 
was positive, with the most liked response coming from a Japanese researcher 
whose name is easily confused with others. He wrote, 

It just doesn’t make sense to have to wonder about completeness and 
ambiguity when looking up someone’s scientific record. 

Frustration of this kind will be eliminated by wide ORCID adoption. 

As a young organization attempting to solve a complex problem on a global 
scale, ORCID has its share of growing pains. Some recent negative scrutiny 
surrounded an apparent inability to opt out of Google Analytics [41] tracking on 
orcid.org, but this issue was found to a technical fault with the specific blocking 
software being used [42]. 

In fact, ORCID membership is rapidly growing. In mid-November 2014, two 
years after the launch of the ORCID Registry, the organization issued its one 
millionth identifier. Versions of the ORCID site in Russian and Portuguese soon 
followed. 

4.1 ORCID in the broader context of Internet identity 

It is important to recognize that ORCID augments rather than replaces the 
balkanized set of identifies associated with the landscape of Figure 1. Individual 
institutions such as universities and national laboratories will continue to main¬ 
tain identity and access management services based on local identities. But, as 
Keith Hazelton of Internet2 and University of Wisconsin points out, ORCID iDs 
could be used as an identifier to “cross-walk” among appropriately federated 
campus systems [43]. Such a process could make collaborative research more 
efficient by simplifying access to pools of data and computing resources that 
cross institutional boundaries. 

As argued in the Astronomy Facebook page discussion, the registry aspect of 
ORCID is a mixed blessing. On one hand, a site that automatically populates 
itself with the papers published, grants received, codes authored, etc., by an 
individual sounds like a great idea. On the other hand, INSPIRE and ADS already 
provide such a service for publications, and most researchers already expose their 
professional profiles through personal web pages and/or existing commercial 
services offered by the likes of linkedin.com, researchgate.com, mendeley.com. 
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and academia.edu. How does the orcid.org registry relate to these other prohle 
solutions? 

This is a simple question without a simple answer, but we stress that ORCID 
principles emphasize personal control of one’s registry content, so there are a 
number of possible paths. As already mentioned, there is the simple option of 
declaring an ORCID profile private which minimizes its exposure on orcid.org. 
As INSPIRE and ADS integration with ORCID advances, the option of exposing 
publications through orcid.org should become more attractive. The ability to 
claim a broader variety of products such as data, codes, videos and blog posts 
across a set of repositories is also an appealing aspect. 

Finally, we note that there is an independent effort supported by the Interna¬ 
tional Standards Organization (ISO) that also seeks to create global identifiers 
for creative activities. The International Standard Name Identifier [ISNI, 44] 
has a mission to “assign to the public name(s) of a researcher, inventor, writer, 
artist, performer, publisher, etc. a persistent unique identifying number in order 
to resolve the problem of name ambiguity in search and discovery.” ISNI and 
ORCID use the same 16-digit identifier format. In early 2014, the two organi¬ 
zations signed a memorandum of understanding (MOU) to develop a strategic 
partnership [45]. The following paragraphs from the MOU help differentiate the 
two organizations. 

The ISNI identifier is a global standard for identifying the millions of 
contributors to creative works and those active in their distribution, 
including writers, artists, creators, performers, researchers, producers, 
publishers, aggregators, and more across the global information 
supply chain. It is part of a family of international standard identifiers 
that includes identifiers of works, recordings, products and right 
holders in all repertoires, e.g., DOIs, IS AN, ISBN, ISRC, ISSN, 
ISTC, and ISWC. 

ORCID is an open and interdisciplinary community-based effort to 
provide a self-claim registry of unique researcher identifiers. To ensure 
the identifier links researchers with their works, ORCID works with 
the research community to embed these identifiers in workflows, such 
as manuscript submission, grant application, and dataset deposition. 
ORCID is unique because of its direct relationship with researchers 
and with organizations throughout the research community. 

Simply (if somewhat imprecisely) put, ISNI is aimed at people pursuing broad 
creative activities while ORCID is more focused toward academic researchers. 
ORCID’s existing partnerships with organizations in physics, astronomy, biology 
and other science and engineering areas makes it the natural choice for researchers 
in these domains. The MOU states that ORCID and ISNI “are agreed on the 
need to aim for interoperation through linking identifiers and sharing public 
data between the two systems,” so it is likely that an ORCID iD will effectively 
serve as an ISNI in the near future. 
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5 Summary 

Science is increasingly multidisciplinary and collaborative at an international 
scale. As a result, discovery, attribution and recognition of individual accom¬ 
plishments is getting harder. 

A solution to unique, persistent scholarly identity is required, and the ORCID 
service is now available as an open, non-commercial, cross-disciplinary, global 
solution. The learned societies of physics and astronomy have endorsed ORCID 
from its formative stages and are actively engaged with its adoption. The benehts 
to scientihc society of such a service are too valuable to ignore, and a solution is 
ready. 
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